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Abstract 

In this paper, we introduce an efficient interference alignment (IA) algorithm exploiting partially 
coordinated transmit precoding to improve the number of concurrent interference-free transmissions, i.e., 
the multiplexing gain, in multicell downlink. The proposed coordination model is such that each base-station 
simultaneously transmits to two users and each user is served by two base-stations. First, we show in a 
if -user system operating at the information theoretic upper bound of degrees of freedom (DOF), the generic 
IA is proper when K < 3, whereas the proposed partially coordinated IA is proper when K < 5. Then, 
we derive a non-iterative, i.e., one shot, IA algorithm for the proposed scheme when K < 5. We show that 
for a given latency, the backhaul data rate requirement of the proposed method grows linearly with K. 
Monte-Carlo simulation results show that the proposed one-shot algorithm offers higher system throughput 
than the iterative IA at practical SNR levels. 

Index Terms 

Interference Alignment, Coordinated Precoding, Multiplexing Gain, Feasibility. 

I. Introduction 

The problem of inter-cell interference in cellular networks has been the major impediment in delivering 
a consistent level of quality of service to geographically distributed users with an efficiently high spectral 
efficiency in terms of bits/sec/Hz/user. In a recent attempt to break the interference barrier, the information 
theoretic concept of interference alignment (IA) was used. This concept was initially introduced by 
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Maddah-Ali et. al to solve the MIMO X-channel fl]. In 0, authors applied the concept of IA to K- 
user interference channel (IC) and obtained the theoretical upper bound of multiplexing gain or degrees of 
freedom (DOF). However, the practical development of IA in cellular networks is highly complex, e.g., an 
iterative process has been suggested in to design the transmit and receive beamformers. In this paper, 
we modify the abstract distributed IA J3]|, by introducing a level of structured coordination amongst the 
transmitters, i.e., base-stations (BSs), in the cellular network, to improve the achievable spectral efficiency 
at the cost of a limited overhead of backhaul processing. In our proposed coordination model, each user 
receives its desired signal from its dedicated BS, as the Primary transmitter, and a coordinating BS, as the 
secondary transmitter. In this chain of coordinated transmissions the first BS in the network coordinates 
with the second BS, the second BS coordinates with the third BS and so on, and eventually, the last 
BS coordinates with the first BS to complete the coordination cycle. In a sense, this transmission model 
is similar to Wyner circular model Il4ll. lfl2ll as each user is supported by two BSs, but slightly departs 
from this model as each user is modeled to receive interference from all the BSs in the network. The 
coordination link between two BSs is uni-directional, e.g., the first BS shares its channel state information 
(CSI) with the second BS and serves as the secondary transmitter to the user of the second BS. As a 
result, each BS transmits to two users, i.e., to its own user as the primary transmitter and to the user of 
the coordinated BS as the secondary transmitter. 

Through feasibility analysis, we compare the proposed coordination scheme with the generic IA. We 
establish some sufficient conditions for obtaining a proper system that can attain a feasible IA solution 
according to [6J. We show that, operating at half of the total available DOF, i.e., the information theoretic 
upper bound on DOF, the generic IA is proper for up to 3-user system, whereas the proposed partially 
coordinated scheme is proper for up to 5-user system for the same configuration, i.e., with the same number 
of transmit/receive antennas and the same number of DOF. Then, exploiting the rank deficiency of the 
interference covariance matrix of the reciprocal uplink channel, we propose an algorithm that implements 
the IA solution in the proposed coordination model in one shot. We establish a DOF upper bound for 
obtaining the one shot, i.e., non-iterative, solution and analyze the system configurations operating below 
the one shot upper bound. Furthermore, we analyze the backhaul overhead requirement of the proposed 
coordination model and show that the backhaul overhead in our model increases linearly with increase in 



3 

the number of participating BSs. Simulation results confirm that our proposed scheme improves the sum 
rate performance compared to the iterative solution in 0, for limited usage of the backhaul. 
This paper is organized as follows. In section II, we introduce and formulate the proposed coordination 
scheme. In section III, we analyze and compare the feasibility of our proposed scheme with the generic 
IA and introduce a one shot IA solution for the proposed coordination scheme. Moreover, we compare the 
backhaul overhead in our proposed partially coordinated system with systems having fully coordinating 
BSs in terms of backhaul data rate requirement. In section IV, we support our analysis with simulation 
results and discussions. Finally, section V concludes this paper. 

The following notations are used throughout this paper. Bold lowercase and uppercase letters, e.g., x 
and X, represent vectors and matrices, respectively. The i th dimension of x is shown by and 
indicates the element at row % and column j, i.e., (i,j) th element, in X. Similarly, after partitioning a 
matrix into submatrices, we use calligraphic letter AV, to define the submatrix at (i,j) th element of X. 
The notation rem(a, b) and mod(-) c respectively represent the the remainder of the fraction a/b and the 
modulo-arithmetic operation over c, where a, b and c are all integers. We use the operators |_-J , [■], 
rank(-), Null(-), Trace (•) and (•, •) to compute the floor, ceil, rank, null space, trace and dot product 
operations of their respective arguments. The operator Diag(-) generates a block-diagonal matrix using 
the elements of its argument. The superscripts T , * and H represent the transpose, complex conjugate and 
conjugate transpose operations, respectively. 

II. System Description and Formulation 

We consider a partially coordinated downlink transmission in a MIMO cellular system with K BSs. 
In our proposed coordination, BSs are connected in pairs and in succession via backhaul links, as shown 
in Fig. [I] We assume that the backhaul network is based on a line or a ring topology with the k th BS, 
k = 1,2, .. .,K, as the k th node in the line or the ring. The k th BS has transmit antennas and its 
corresponding user terminal has receiving antennas. We also assume time-division duplex (TDD) 
communication with synchronized time slots to ensure channel reciprocity and to use uplink CSI in 
the downlink. The successive pair-wise coordination among BSs is such that the k th BS establishes a 
coordination link with BS k', where k' = mod(/c — 2) K + 1, in two steps as follows. First, BS k! shares its 



4 

CSI with BS k. Next, BS k exploits this CSI to compute its transmit beamformer G C nkXdk , where 
nk = nk + rik> and dk is the DOF of user k. Then, is used to transmit dk independent data streams 
towards user k through the transmit antennas of both BSs k and k'. In this coordination setup, the BSs k 
and k' serve as the primary and the secondary transmitters for user k, respectively. We assume flat fading 
channel that can be achieved by employing MIMO orthogonal frequency division multiplexing (OFDM) 
JH as the baseband modulation. Moreover, we assume no intracell interference within a cell, as this can 
be obtained by using orthogonal frequency division multiple access (OFDMA). If the coordination among 
BSs is ignored then this system reduces to the standard K-user IC with nik X nk MIMO links. In the 
proposed scheme the overall received signal y = [yf y<f . . . y^] T , with y k G C dkXl as the k th user's 
received signal, can be expressed as 

y = XJ H (HVx + n) = U^HVx + XJ H n, (1) 

where x = [xf xj . . . x^] T and n = [nf . . . ir^] T , with x fe G C dfcXl and n fe G C mfeXl , respectively, 
representing the transmitted symbol vector and the noise vector for user k. Furthermore, H G C mxn , with 
rh = J2k=i m k an d n = J2k=i n ki i s the overall channel matrix which is composed of K 2 submatrices 
Hij G <C miXn i , i,j = 1, 2, . . . , K, as the (i,j) th element of H. Hij represents the m 8 x rij MIMO channel 
between BS j and user i. In ([T]), the overall receiver beamformer matrix is indicated by a block-diagonal 
matrix U = Diag(Ui, U 2 , . . . , Ujj-), where the columns of G C nikXdk form the receiver beamforming 
vectors of user k. The overall transmit beamforming matrix V G C nxd in ([T|), with d = J2k=i ^ s gi ven 
by 
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where the columns of V fc G C nkXdk and V fc G C nfe/Xdfc are the distributed transmit beamforming vectors 
towards user k through the antennas of BS k and BS k', respectively. Hence, the received vector by user 



k can be written as 

Desired Signal 



H C ^ 



K 



(^fciVi + Ww'ViJxi+nfc), fc = l,2,...,A-, (2) 



> 



Interference Signal 



where i' = mod(i — 2) K + 1. The solution to Q or (|2]) is attained by selecting XJ k , V fc and V fc such that, 

Uf (U ki Vi + =0 V fc ^ i and 

rankfuf (H fcfc V fc + H fcfc ,V fe )) =4 for fc = 1, 2, . . . , K (3) 



To solve for the transmit and the receive beamformers in (|3]), we first transform ([T]) to an equivalent 
interference channel problem as follows. 

A. An Equivalent Interference Channel Model 

Let us define the permutation matrix P = [Pi P 2 ... Pr], where P& = [l& 1^] for k — 1, and 
P fc = [lfc/ l fc ] for Jfe = 2, . . . , if with fc' = mod(A; -2) K + 1 and 1, G M" xrtl given as, 



1, 



1,2 AT. (4) 



n 1 o 1 t 1 o 1 o 1 

u l ■ ■ ■ u i-l J -i u i+l ■ ■ ■ U K 

In Q, Xj is rij x identity matrix and Oj, for j = 1, 2, . . . , K and j 7^ z, represents the x rii matrix 
with all zero elements. We define an equivalent interference channel matrix G = HP which can be 
partitioned into K 2 submatrices Qij G C miX ™ J , with rij = rij + rif, and i,j G {1,2, . . . , K}, where, 

[Hi,i 7~Li,K} , j = 1 

(5) 

\Hi,o-i H-ij] > 3 ' — 2, • • • K, 



ft, j 



represents the submatrix at the (i,j) th location in G. Next, we define the block-diagonal matrix W 
Diag(Wi, W 2 , . . . , *Wk), where its j th element Wj G C njXdj is given as, 



[VjVjf, j = l 
W 3 - = <( L 3 1 J (6) 

[VjVjf, j = 2,...K. 

Using (|6]), it can be easily verified that W = P T V. Therefore, GW = HPP T V = HV, as the 
permutation matrix satisfies PP T = I. Using this relation, we transform our partially coordinated multicell 
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model into a K-user m k x n k MIMO IC problem, where is the transmit beamformer for user k. Thus, 
([T]) and ([2]) can be rewritten as, 

y = U^GWx + V H n, (7) 

and 

K 

y k = Uf \Q k ,k W fc x fc + ^ ^WjXj + n fc J , (8) 

respectively. Similarly, according to the IA principles 0, the design constraints in ([3]) become 

Uf g k ,iWi = 0, V fc ^ i and 
rank(Uf £ M W fe ) = 4, for i, k = 1, 2, . . . , K (9) 

III. Interference Alignment with Partially Coordinated Precoding 

In this section, first, we analyze the sufficient conditions for obtaining a proper uncoordinated generic 
interference channel system in terms of maximum number of users supported at the information theoretic 
upper bound on DOF and compare it with our proposed partial coordination scheme, in ([9]). 

A. Feasibility Analysis 

According to 0, a generic if-user IC with m k x n k MIMO links, denoted as a Y\k=i ( m k x n fc, d/.), is 
a proper system if the number of variables involved in each and every subset of equations is greater than 
or equal to the number of equations in that subset. Based on the intuitive discussion in |6l (see sec. V 
and VI), authors argue that a proper IC system is likely to attain a feasible IA solution. In this context, a 
symmetric IC system, with m k = m, n k = n and d k = d, V k, described as (m x n, d) K , is proper if the 
total number of equations is less than or equal to the total number of variables, i.e., m + n > (K + l)d. 
Moreover, in [6], the various groups of IC systems are classified such that within a group m + n = c, 
where c is a constant, and min(m, n) > d, V k. Based on the classification of a group of systems, the 
authors in (6) establish a rule of thumb which states that if any system within a group is proper then 
almost all the systems of that group are also proper and vice versa. 

Following the definitions in our equivalent IC system in Q can be modeled as fT-i ( m fc x ^k,d k ). 
Furthermore, the information theoretic upper bound for DOF, introduced in 0, can be tightly approximated 



as d = J2k=i ( m k + n fc ~ rem(m fc + n k , 2))/4 for the general fT-user IC and d = [ 2 2 j = |_^( c — 
rem(c, 2))/4j, for the symmetric .fT-user case, where rem(c, 2) is or 1 for an even or odd values of c, 
respectively. Note that d is not necessarily an integer multiple of K. This requires to privilege a number 
of users by allowing them to operate at a higher integer number of DOF, i.e., \d/K~\ , than the others, i.e., 
operating at [d/K\. However, this may not be the best policy of allocating the DOF resources amongst 
the users due to the fairness considerations as well as the fact that different users experience different 
fading conditions in the channel. Clearly, assigning a larger DOF to a user facing an aggressive fading 
condition in the channel is not an efficient policy, as it may require a higher transmit power to deliver 
a prescribed level of data rate per stream. In the following, we propose a time-sharing strategy over a 
number of time-slots so that the available DOF resources are equally distributed, when averaged over all 
time slots, amongst all K users. 

Theorem 1: (A uniformly time-shared distribution of DOF in the generic IA) A system of i^-user 
asymmetric interference channel described by Ylk=i ( m x n ><4)' where user k is operating at an integer 
number of d k of DOF so that d = J2k=i dk — L-^( c ~~ rem(c, 2))/4j is the information theoretic upper 
bound, can be described as an equivalent symmetric system of (m x n,d/K) K defined under a time 
sharing schedule over a span of (^) time slots, where a = rem(d, K). 

Proof: Let r = (^) be the total number of allocated time slots, a be the remaining number of DOF 
to be shared uniformly among K users, and 6 = j^r = (^Ti)- I n the equivalent symmetric system each 
user k operates at a DOF of for the duration of 9 time slots and at a DOF of [ j^J for the remaining 
t — 9 time slots. The distribution of DOF is attained such that in each of the r time slots the total number 
of independent data streams in the interference channel system is d. Over the span of r time slots, each 
user k attains a total of 6\£] +(r-9) [£\ = t[^\ + 9 DOF. Effectively, the DOF per time slot for each 
user k, i.e., d k , is given as 



r[i\+9 J. a 
K K 



d k = -^i — = L^J + 



d — a a d 
= — + K = K- (10) 

Thus, over the span of r time slots the system can be modeled as (m x n, d/K) K system. ■ 



Example 1: Consider an asymmetric 5-user interference channel with 3x3 MIMO links where each 
user operates at its information theoretic upper bound of DOF, i.e., d — [5 x 6/4J = 7. This system can 
be described as (3 x 3,2) 2 (3 x 3, l) 3 . Applying Theorem [I] one can easily verify that after assigning 
one DOF per user the remaining number of DOF to be allocated is a = 2, the total number of required 
time slots is r = (I) = 10 and 9 = (10)2/5 = 4. Hence, over the duration of 10 time slots, each user 
operates at [7/5] = 2 DOF for 4 time slots and at [7/5\ = 1 DOF for the remaining 6 time slots, i.e., 
a total number of 14 DOF. Therefore over a span of 10 time slots, the equivalent symmetric system can 
be modeled as (3 x 3, 7/5) 5 , where each user attains 7/5 DOF per time slot. 

In the sequel, we calculate the upper bounds on the number of users that can be served at the information 
theoretic limit of DOF in the generic and the proposed partially coordinated proper systems. 

Corollary 2: (An upper bound on K for generic IA) A system of K-user interference channel described 
by Ylk=i ( m x n > dk) that operates at the information theoretic upper bound d = [^(c— rem(c, 2))J , where 
c = m + n, min(m, n) > \d/K~\, and rem(c, 2) is either or 1 for even or odd values of c, respectively, 
is proper if 

K < 3 + 4 rem(c ' 2 ' . (1!) 
c — rem(c, 2) 

Proof: An asymmetric system fT-i ( m x n > ^fc) can be modeled as an equivalent symmetric system 
(m x n, d) K , where d = dj K ~ |(c — rem(c, 2)), using the time sharing concept of DOF, developed in 
Theorem [TJ To obtain a proper symmetric system, we require the total number of equations N e and the 
total number of variables N v satisfy N e < N v , i.e., 

Kd < c - d 

K(c- rem(c, 2)) < 3c + rem(c, 2) 

rem(c, 2) 



K < 3 + 4- 



c — rem(c, 2) 

which is a sufficient condition for obtaining a proper system. 



Remark 1: It can be easily verified from ( fTT] ) that the system n^Li ( m x is proper if K < 4. 

The equality of K = 4 holds only for c = 5 and for c ^ 5, we have K < 3. The exceptional case of 
K = 4 with c = 5 applies to the group of symmetric systems (1 x 4, l) 4 , (2 x 3, l) 4 , (3 x 2, l) 4 , (4x1, l) 4 
systems that are given as Examples 1, 3 and 5 in [fSJ. Furthermore, generic IC systems operating at their 



information theoretic upper bound of DOF with K > 4, i.e., with an exception of c = 5 for K = 4, are 
not proper and, hence, are not feasible to attain the IA solution. This fact is confirmed in Fig. 4 of 0, 
where the DOF upper bound cannot be achieved for four users with 4 and 5 antenna cases. We will also 



confirm this through simulation results presented in section IV 

Next, we establish an upper bound on K for a proper partially coordinated IA system. 

Corollary 3: (An upper bound on K for IA with partial coordination) A system of fC-user partially 



coordinated channel described by the equivalent interference channel representation Ylk=i ( m x 2w, djt), 



that operates at the information theoretic upper bound d = |_if( c — rem ( c i2))J, where c = m + n, 
rnin(m, n) > \d/K] and rem(c, 2) is either or 1 for even or odd values of c, respectively, is proper if 

m + n — rem(c, 2) 

Proof: An asymmetric system Ylk=i ( m x dk) can be modeled as an equivalent symmetric system 
(m x 2n, d) K , where d — d/K ~ |(c — rem(c, 2)), using the time sharing concept of DOF, developed in 
Theorem [I] One can easily verify that, here, the total number of equations N e = Kd 2 (K — 1) and the total 
number of variables N v = z~2f=i <^( m + 2n — 2d) = d(J2k=i ( m + n ) + SfcLi n ~ ^Kd) = Kd(c+n — 2d). 
To obtain a proper system, we require N e < N v , i.e., 

Kd < c + n — d 

n + rem(c, 2) 



K < 3 + 4 
< 3 + 4 



c — rem(c, 2) 
n + rem(c, 2) 



m + n — rem(c, 2) 

which is a sufficient condition for obtaining a proper system of IA with partial coordination. ■ 
Remark 2: It can be easily verified from ( [12] ) that the equivalent IC system Ylk=i ( m x ^w, dk) is proper 



if K < 5, when m = n ± rem(c, 2), V k, and its information theoretic upper bounds on DOF for K = 3, 

K = 4 and K — 5 are given as d = |_|(c — rem(c, 2))J = [3n/2j, d — \_(c — rem(c, 2))J = 2n and 
d — [|(c — rem(c, 2))J = \Jm/2\, respectively. 

In the sequel, we introduce a one- shot algorithm that achieves DOF upper limits within d < 2n by IA 
with partial coordination and discuss, in more details, the achievability of various DOF configurations for 
K = 3, K = 4 and K = 5. 
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B. A one shot solution for IA with Partial Coordination Scheme 

Here, we describe a one shot IA solution to find the receive and the transmit beamformers, i.e., Uj £ 
C niiXdi , % = 1,2,..., if, and W fc £ c flfcXrffc , = 1,2,..., if, respectively, for the proposed partial 
coordination scheme in two steps. First, we design Uj, V i, for any i th receive beamformer such that the 
signal power received by each user i in the original network is maximized. Then, using these receive 
beamformers in the reciprocal network, where roles of the transmitters and the receivers are reversed, 
we design Wfc, V k, such that the interference leakage at the receiving ends of the reciprocal network 
is minimized to zero. These steps are described as follows. Note that in the following analysis, we use 
the notation defined in to demonstrate the variables in the reciprocal network, i.e., in the reverse link, 
with a left arrow on top. 

Step 1: In the original network, the desired signal power at receiver i due to the partially coordinated 
transmission from BSs i and %' is given by 

Si = Trace (Uf Q,U,) , (13) 

where 

is the signal covariance matrix at user i and is the power of message signal Xj transmitted through 
BSs i and i', i.e., the i th transmitter in the equivalent IC model. First of all, the i th transmitter computes 
the singular value decomposition of Qi ti £ C miXn % % = 1,2, ... ,K, as Q iyi = FjSjM^, where Fj = 
[f| l] . . . fgj, = Diag(af ] . . . a { X) and M, = [mf 1 . . . mlj, and initializes W* = M. t = [mf ] . . . mg], 
i.e., the first di columns of Mj. Then, the receiver i solves the optimization problem max Si 

Ui-.rriiXdi, U,Uf=I d . 



to find the receive beamformers Uj that maximizes S{ in ( |T3| ). It can be easily verified that ( [13] ) is 
maximized when Uj = [fj*' . . -fl], i.e., the first di columns of Fj, and the maximum received signal 
power Sf 13 * is calculated as 

sr x = ^X>f) 2 d4) 

which is equivalent to the received signal power at user i with di DOF in the single user scenario. 

Step 2: In the reciprocal network, where the k th user is the k th transmitter, i.e., tv fc = XJ k , and the 
coordinating BSs k and k! form the k th receiver, i.e., = W fc , the total interference leakage at the k th 
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receiver due to all undesired transmitters 1,1 ^ k, is given by 

f k = Trace (tjf tj*tj fc ) = Trace (wf t^W fc ) , 

where 



A' 



di 

i=i,i^k 1 



is the interference covariance matrix at receiver fc, P; is the power of the message signal j transmitted 



in the reverse link from user I and due to channel reciprocity Q k,i = Qf k , V I, k. The k th receiver in 



the reverse link solves the optimization problem min I k to find that minimizes the 

W k :n k xd k , WfW fe =I dfc 

interference leakage due to all undesired transmissions, i.e., all users except user k. This optimization 



problem designs W fc to be the eigenvectors corresponding to d k smallest eigenvalues of Qfc. Note that 
the interference covariance matrix at the k th receiver in the reciprocal channel, i.e., Q fe e C nkXTbk has a 



rank of r k = ^2 i=1)ijik d { = d - d k . By defining a k = Nullity(Q fc ) = n k - r k , where Nullity(Q fc ) is the 
number of dimensions in the null-space of t^ k Q, one can easily verify that d k < a k , V k, when d < n, 
where n = min(n 1; . . . ,n K ) for asymmetric systems and n = 2n for symmetric systems. Hence, is 
chosen in the null-space of Q fc to suppress the interference leakage 7~ fc in the reciprocal network. 
Next, we show that choosing W fc in the null-space of t^ fc and using its columns as the transmit beam- 
forming vectors in the original network will suppress the interference to other unintended users I ^ k. 
Let us decompose Q k as <3fc = R fc Rf , where R fc = G k \J k , G k = [Qf k . . . Q k _^ k Q k+1 , k ■ ■ ■ <3§ >k ] 
and U fc = Diag^L-.U^x U W ...U K ). Since W^ k W k = (R^W fc )^RfW fc = 0, therefore, 
Rf W fc = 0. Finally, by expanding R fc in Rf W k = 0, we arrive at Uf Gi !k W k = 0, V / ^ k, 
I = 1, 2, . . . , K, which satisfies the first condition of the interference alignment solution in (|9]). Therefore, 
in this way, BSs k and k' can jointly transmit the message signal x fc to user k with d k DOF without 
causing interference to the other users. 

Due to the update of W 8 , i = 1, 2, . . . , K, in step 2, the received signal power at the i th user of the 



i- ( ,4- 



original network deviates from the single user performance, i.e., Sf 1 ^, in (14), as follows. Starting with 
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( fT3| ) and substituting for Qj with updated W, values calculated in step 2 and Uj = F i? we can write 

Pi 



Si = j Trace (#f M W,W? SgF, 



rf J Trace ^ffe^wf (wjf^ 



But, it can be easily verified that £^F;Ff £ M = MjSfMf , therefore 



^EC^EKmf.wW)! 2 , (15) 



3=1 «=1 



where Sj = Diag(cx[ . . . a d _) and Wj = [w^ . . . w d ]. A comparison of ( fT4[ ) and ( |T5| ) shows that each term 
(cr^) 2 , i.e., the channel gain corresponding to the j th largest singular values of Q^, is degraded by a factor 
of Ylfti K 1 * 1 ^ w f )l ' where (m^,wj^) is the projection of transmit beamformer wf\ I — 1, . . . , di, on 
mj i] . Since WfW< = 1^, V z", and MfM; = I d ., V z, therefore, |(mj* ] , wf ] )| < I. However, despite this 



degradation, the Monte-Carlo simulation results, described in section IV show that the one shot partially 



coordinated IA offers higher sum rate than the distributed IA in similar scenarios. 

C. Achievable gains in IA with partial coordination 

In a symmetric distributed IA system with c = m + n and m = n + rem(c, 2), V k, the achievable 
DOF information theoretic upper bound satisfies d = \K(c — rem(c, 2))/4j > n for K > 2, whereas in 
partially coordinated IA, it is limited by d < 2n, for K < 4, as established in Remark 2. Furthermore, for 
a symmetric distributed IA, — dk = n — d < or equivalently < dk, for K > 2, because has a 
rank of r\ = d — dk and contains only n columns. This implies that cannot be designed in one shot 
in the null-space of Qfc and an unpredictable number of iterations is needed to reduce the rank of 
These iterations continue until the required number of dimensions of dk is reached for the null-space of 
Qfc. For instance, the distributed IA in a symmetric interference channel system with K = 3, m = n = 2, 
d = 3, i.e., dk — 1, V k, requires iterative adjustment of transmit beamformers until r k = 1 is reached 
at each receiver. Whereas, for a symmetric IA system with partial coordination a DOF of d that satisfies 
d < 2n can be achieved in one shot because d^ < a*., i.e, the null space of Q fc has enough dimensions 
to contain d k DOF, or d k independent streams per user k, from the starting point of iterations. Let the 
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columns of G C 2nxafe form a set of orthonormal basis for the null-space of C» fe and the columns of 
Wfe G C 2nxdk be selected from the columns of T*.. Then, for each user k, we have the flexibility of 
choosing a set of d k columns from E k = (^ fc ) possible selections of sets for Clearly, the optimum 



set would be a set of d k columns that maximizes the received signal power at user k according to ( [15] ), 
i.e., max S k . As an alternative approach, one can set the columns of such that the geometric 

W t =[wfl..wW] 

mean of the individual SNR (see section 2.5 on p. 33 in flU) values of all the dk parallel independent 
channels of user k is maximized. 

The steps of the one-shot IA algorithm to implement IC systems with the proposed partial coordination 
are summarized in Algorithm [TJ as follows. 

Algorithm 1 One shot Algorithm for IA with Partially Coordinated Transmit Precoding 
l: First, design the receiver beamformers TJ k , k = 1, 2, . . . , K. Set the d th vector of d = 1, 2, . . . , dk, 

as Uk,d = ^d(Qk,kGkk)^ where v d (~X.) represents the eigenvector that corresponds to the d th largest 

eigenvalue of X. 

2: At each user k, using the CSI of BS k', i.e., H# = [Hi t k' ■ ■ ■ HK,k']> construct = [Gi,k ■ ■ ■ Gic,k] 

according to ([5]). Compute Qjt and T k = Null(Q fc ). 
3: For each user k, define A* G C™ fcXdfc , for i = 1,2, . . . ,E K , E k = (* fc ), such that each matrix A, 

contains a unique selection of d k columns of TV 
4: For each user k, compute Bj = UkGk,kA%> i — 1)2, ...,Ek, and find the eigenvalues of B, as 

A^,...,A^. Compute the geometric SNR coefficient 7; = n^=i ^ an( ^ set = A max , 

where A max is the matrix Aj corresponding to the largest value of 
5: Finally, set the primary and secondary transmit beamforming vectors V fc and V fc , respectively, for 

each user k, according to ([6]). 



Definition 1: We call a system satisfying d < n, where n = min(ni, . . . , uk) in an asymmetric system 
and n = 2n in a symmetric system, or equivalently d k < a k , V k, as flexible due to the flexibility of 
selecting in this system. Similarly, a system with d = n or equivalently d k = a k , V k, is termed as 
rigid. In a rigid system E k = 1 and therefore, Wjt = T k , V k. 

In the following examples, we analyze various symmetric system configurations for K = 3, K = 4 and 
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K = 5 in the proposed partially coordinated scenario and design the transmit beamformers W k such that 
the overall throughput is maximized. 

Example 2 (K = 3): For K = 3 symmetric systems, the information theoretic DOF upper limit is less 
than the number of columns of Q^, V k, i.e., d = [3n/2\ < 2n, and therefore, dk < a>k, V k. Hence, 
partially coordinated IA can achieve this system with d DOF in one-shot. Furthermore, we can attain higher 
throughput for K = 3 by increasing the number of independent data streams within |_3n/2j < d < 2n 
interval. For example, with m = n = 2, the proposed one shot solution achieves d = 3 and d = 4, 
whereas, the distributed IA algorithm ^ can only achieve d = 3 iteratively. Likewise, with m = n = 3, 
the proposed partially coordinated IA achieves d = 4, d = 5 and d = 6 in one-shot, whereas, the distributed 
LA can only attain d = 4 through iterative process. 

Example 3 (K = 4): As established in Remark 1, a conventional IC system with K > 4 is not proper [] 
at the information theoretic upper bound of DOF and, hence, cannot be achieved by the generic IA. This 
is confirmed in J3), where the distributed IA achieves K = 4 with m = n = 4 at a total of 6 degrees of 
freedom out of a theoretically possible 8 and K = 4 with m = n = 5 at a total of 8 degrees of freedom 
out of a theoretically possible 10, (see Fig. 4 in 0). Whereas, these theoretical upper bounds can be 
attained in one shot, as d < 2n in both cases, by introducing the proposed partial coordination into the 
interference channel. Furthermore, the iterative IA in [fJl is unable to solve the conventional IC systems 
with K = 4, m = n = 2dXd = A and K = 4, m = n = 3atd = 6, whereas the introduced IA with 
partial coordination achieves both of these systems in one shot, because they both satisfy d < 2n. 

Example 4 (K = 5): According to Corollary [5j an IC system using the proposed partial coordination 
is proper for K = 5 and can attain the upper bound on DOF, but, since d = [5n/2\ > 2n, it cannot be 
achieved in one shot. However, the iterative IA [O algorithm operating on the equivalent IC model, i.e., 
Ilfc=i ( m x dk) can achieve it using the IA principles in ((8]) and (|9]). The proposed one-shot solution 
can implement this system with a number of DOF that is under its theoretical upper bound, i.e., as a 
rigid system with a total number of degrees of freedom of at most 2n. For instance, the distributed IA 
01 achieves the equivalent IC system of Ylk=i ( m x dk) with m = n = 2 with 5 streams, i.e., at 
d = 5, iteratively, while, the proposed one-shot algorithm implements it with 4 streams, i.e., at d = 4. 

'the exceptional feasible case for K = 4 with c = ?n + ?i = 5is identified in Remark 1. 
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In one-shot implementation, these 4 streams can be shared among 5 user by employing the time sharing 
method described in Theorem [TJ such that over the span of 5 time slots each user k operates at dk = 4/5 
degrees of freedom. Note that within each one of the allocated 5 time slots, only 4 users are allowed to 
transmit while one user remains silent. This scenario for one-shot implementation can be considered as a 
special case for the K = 5 system as it is effectively solved by modeling it as a K = 4 system per time 
slot. As another example, the distributed IA [3] achieves the equivalent IC system of Ylk=i ( m x ^fc) 
with m = n = 3 with 7 streams, i.e., at d = 7, through an iterative process, whereas, the one-shot 
algorithm implements it as a rigid system with 6 streams, i.e., d = 6,. 



These discussions and examples are confirmed through Monte-Carlo simulation results in section IV 



D. Overhead of CSI exchange in the backhaul 

In practice, the backhaul network is built using dedicated point-to-point links (which can either be 
wired, e.g., fibre optic, or wireless, e.g., microwave) connecting all the BSs in a line or a ring topology. 
The implementation details of these topologies in wireless backhaul are discussed in chapter 6 of |[i~0ll 
The idea of using the line or the ring topology to model the multicell system was introduced by Wyner 
flU. This idea, commonly referred as Wyner linear or Wyner circular model, has been studied extensively 
in the context of joint multicell processing |fTT) . |fT2ll . In the following, we examine the data rate required 
for the exchange of CSI in the backhaul in the proposed coordination model. 

Let At be the overall permissible latency to exchange CSI in the backhaul. Let R be the required data 
rate to exchange the CSI of a single MIMO link, i.e., Hij, V i,j, between two coordinating BSs during 
At. Hence, in the ring model, a data rate of KR is required in each and every BS-to-BS link to exchange 
the multiuser CSI, i.e., Hj = \H\ t i ■ ■ ■ Hk^}, i = 1, 2, . . . , K, in Fig. [Tj within At. Whereas, in the line 
model, the inter-BS data rate of 2KR is required due to the broken link between the first and the K th 
coordinating BSs and the fact that the CSI of the K th BS, i.e., H^, should travel through all other links 
to reach the first BS. Therefore, the data rate requirement for CSI exchange in the backhaul grows linearly 
with K in the proposed coordination structure in Fig. [TJ Whereas, in a system with fully coordinating 
BSs, e.g., [|T3l - 031, it can be easily verified in a similar way that the data rate requirement for overall 
CSI exchange among all BSs within the same allowed latency interval of At requires a data rate of K 2 R 
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and K(K — 1)R in the line and the ring based backhaul models, respectively. 

IV. Simulation Results 

In this section, we report our simulation results on the achievable sum rate of the proposed partial coordi- 
nation scheme and compare them with those of generic IC model, employing distributed IA algorithm 0, 
and full coordination, i.e., block-diagonalization (BD) based zero-forcing (ZF) scheme [9]. Transmitting 
BSs use the same transmit power and each BS equally distributes its power across its independent data 
streams. Figs. [2]j7] compare the achievable sum rate of different schemes for K = 3, K = 4 and K = 5, 
where 2x2 and 3x3 MIMO links are established between a BS and a single user in each cell. Results 
confirm that the BD based ZF in a full coordinated system, where separate base-stations form a giant 
single BS, achieves the highest sum rates amongst the other schemes. These results also confirm that the 
proposed one shot IA outperforms the distributed IA in terms of sum rate performance. It should be noted 
that degradation of performance of the distributed IA is because of the lack of coordination among the 
precoders. Furthermore, we compare the flexible and rigid systems for K = 3, discussed as Example 2 
in section III-C In Fig. [2j i.e., K = 3 with 2x2 MIMO links, the flexible and rigid systems operate at 



d — 3 and d — 4, respectively. Similarly, in Fig. [3J i.e., K = 3 with 3x3 MIMO links, we have two 
flexible systems, with d — 4 and d — 5, and the rigid one operates at d — 6. Results confirm that systems 
with higher d offer higher sum rate at high SNR as the slope of their sum rate curves grow faster at high 
SNR, as shown in Figs. |2]and[3j However, we observe that at lower SNR the flexible system, with lower 
DOF, outperforms the rigid one. This is due to the fact that in the flexible system the geometric mean 
of the individual SNRs is maximized through the selection of the transmit beamformers, whereas, in the 
rigid one the transmit beamformers are fixed. In Fig. [2J the rigid system, i.e., d = 4, offers higher sum 
rate compared to the flexible one, with d — 3, at SNRs higher than 15.5 dB. Similarly, in Fig. [3J the rigid 
system, i.e., d = 6, outperforms the flexible systems of d = 4 and d — 5 at SNRs higher than 6 dB and 
13.5 dB, respectively. In Figs. |4]j7j the sum rate curves of distributed IA [3] in K = 4 and K = 5 with no 
coordination at the precoders saturate at higher SNRs due to leakage of interference into the interference 
free dimensions carrying the information streams. This confirms the result from [0 as these improper 
systems at K = 4 and K = 5, defined by Corollary^ are unable to attain a feasible IA solution. In Figs. 
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[4] and [5j i.e., K = 4 system with 2x2 and 3x3 MIMO links, the sum rate curves of the proposed one 
shot partially coordinated IA operating the information theoretic DOF upper bound, i.e., at d — 4 and 
d = 6, respectively, grows linearly with increasing SNR. This confirms our discussion in Example 3 as 
the proposed partially coordinated scheme solves K = 4 at the information theoretic DOF upper bound 
in one shot. Following the discussion from Example 4, in Fig. |6j i.e., K = 5 with 2x2 MIMO links, 
the distributed IA [0 achieves the information theoretic DOF upper bound at d = 5, i.e., d\~ = 1, V k, in 
the partially coordinated system whereas, our one shot solution achieves d = 4, i.e., d& = 4/5, V fe, as 
described in Theorem [T] Similarly, in Fig. [7J i.e., K = 5 system with 3x3 MIMO links, the distributed 
IA solves the DOF upper limit at d — 7, while the proposed one shot solution achieves d = 6. In 
Figs. [6] and [7J the iterative IA, operating at d = 5 and d = 7, respectively, offers higher sum rate at 
high SNRs. However, we observe that at lower SNRs the proposed one shot IA operating at lower DOF, 
i.e., at d = 4 and d = 6, respectively, outperforms iterative IA. This improvement at lower SNRs can be 
intuitively explained as follows. In one shot IA, the receive beamformers in the original network (i.e., Step 
1) are designed to maximize the received signal power, whereas in distributed IA [3 J, the beamformers are 
chosen to minimize interference leakage in both the original and the reciprocal networks. In Fig. [6} the 
partially coordinated distributed IA with d = 5 offers higher sum rate compared to one shot solution with 
d — 4 at SNRs higher than 26 dB. Similarly, in Fig. [7J the distributed IA with d — 7 outperforms the one 
shot algorithm with d = 6 at SNRs higher than 32 dB. Note that for K = 5, the fully coordinated system 
offers very high multiplexing gain, i.e., d = 10 with 2x2 MIMO links and d = 15 with 3x3 MIMO 
links. Hence, to obtain a better resolution for the sum rate curves of the partially coordinated scheme, 
the sum rate curves for the fully coordinated BD based ZF are omitted in Figs. [6] and |7J In Fig. [8j we 
compare the data rate required for exchanging the CSI overhead in the backhaul in the proposed partially 
coordinated precoding scheme with non-coordinating interference channel model and the full coordination 
model [fT3l - [[T5l . as discussed in section |ni-D| The data rates are expressed in terms of multiples of R, 



where R is the data rate required to send the single MIMO link CSI through the backhaul within the 
allowed latency interval At. The full coordination model requires exponential increase in backhaul data 
rate, whereas, the backhaul data rate of the partially coordinated IA increases linearly with K. 
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V. Conclusions 

In this paper, we introduced a partially coordinated transmit precoding scheme for downlink multicell 
MIMO systems. We transformed the system model of our scheme to an equivalent interference channel 
problem and compared its feasibility with standard interference channel model. We established a sufficient 
condition for obtaining a proper system, in terms of the maximum number of users supported at the 
information theoretic upper bound on DOF, in the proposed partially coordinated IA and the generic 
IA. Furthermore, we introduced a one shot IA algorithm with partial coordination and analyzed the 
maximum achievable degrees of freedom in the system. Simulation results confirmed that, in comparison 
with the generic IA, the proposed partially coordinated IA offers higher throughput at practical SNR levels 
and avoids an uncontrolled number of back and forth iterations between the original and the reciprocal 
networks. 
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Fig. 1. System structure of the proposed partial coordination model in multicell downlink 
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Fig. 2. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA $3$ (i.e., no coordination) and BD based 
ZF [9] (with fully coordinating precoders), for K — 3, with m — n = 2. 
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Fig. 3. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA f3j (i.e., no coordination) and BD based 
ZF [ 9 J (with fully coordinating precoders), for K — 3, with m — n = 3. 
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Fig. 4. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA $3$ (i.e., no coordination) and BD based 
ZF [9 J (with fully coordinating precoders), for K — 4, with m — n = 2. 
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Fig. 5. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA f3j (i.e., no coordination) and BD based 
ZF [ 9 J (with fully coordinating precoders), for K — 4, with m — n = 3. 
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Fig. 6. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA [3] in both partially coordinated and 
uncoordinated scenario, for K = 5, with with m = n = 2. 
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Fig. 7. Sum rate comparison of the partially coordinated one shot algorithm with distributed IA [31 in both partially coordinated and 
uncoordinated scenario, for K = 5, with m = n — 3. 




Fig. 8. Comparison of the data rate required in the backhaul for the exchange of CS1 in the proposed partial coordination model with full 
coordination model and Distributed IA, i.e., non-coordinating scheme, for K = 2 to K — 7. 



